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Abstract— The effective assembly of an antibody molecule requires the proper association of the light and 
heavy chains, namely the tight, canonical association of VH with VL, and of CHI with CL. In this paper 
the interaction of CHI is examined by looking at the degree of conservation of residues in the interface 
between CHI and CL, where CHI can belong to any of the heavy chain classes, and CL can be either 
lambda or kappa. The three-dimensional structures of four antibody Fabs have been examined to see 
which are the significant interacting residues and to see whether they a)so correspond to the conserved 
residues in the different classes. It was found that there are a few hydrophobic residues buried in the 
interface which make numerous contacts with residues of the other chain and which remain invariant, 
or else are highly conserved. Around the periphery of the interface there are numerous interacting residues 
that have appreciable variability. Within the interface there is a cavity, the function of which may be to 
permit some changes in the central interface residues while still preserving the same relative orientation 
of CHI and CL. 



INTRODUCTION 

In an antibody-producing cell large amounts of anti- 
body light and heavy chains are synthesized and are 
assembled into the complete four-chain antibody 
molecule. The work of Dorrington and his colleagues 
(Klein et al., 1979) has demonstrated that the proper 
formation of the Fab requires the interaction of the 
two constant domains, CH 1 and CL. However, each 
CL, whether lambda or kappa, must be capable of 
combining effectively with each CHI, whether the 
CHI is alpha, gamma, delta, epsilon or mu. The 
simplest way of achieving this result would be to have 
the sequence of the domains, or at least the interface 
residues of the domains, remain invariant through the 
different isotypes, but a quick examination of the 
observed sequences shows that this is not the case. 

In order to identify the interacting residues in the 
interface between CL and CHI, the crystal structures 
of four Fabs have been examined. The antibodies in 
these crystals consisted of two human IgG lambda 
(KOL and NEW) and two mouse IgA kappa 
(McPC603 and J539) molecules. The results of this 
examination were then extended to the other classes 
by aligning the sequences with those of these four 
Fabs. The degree of conservation of these interface 
residues in the different classes was then determined. 
In the following text the results of this investigation 
are presented and some unexpected observations that 
were made are described. 

MATERIALS and methods 

Atomic co-ordinates 

The atomic co-ordinates for McPC603 were from 
a structure refined at 2.7 A resolution (Satow et al., 
1986); those for J539 were from a refinement at 2.6 A 



resolution (Suh et al., in preparation). The atomic 
co-ordinates for NEW (Saul et al., 1978) and KOL 
(Marquart et al., 1980) were obtained from the 
Protein Data Bank (Bernstein et al., 1977). 

Protein sequences 

The amino acid sequences were obtained from 
Kabat et al. (1983). The human CHI sequences 
(Table I) were those for EU (Sequence No. 1), TRO 
(No. 34), WAH (No. 42), IgE'CL (No. 31) and GAL 
(No. 38) for human IgG, IgA, IgD, IgE and IgM, 
respectively. The corresponding mouse sequences 
that we use here were those obtained from translation 
of the nucleotide sequence of cloned genomic DNA, 
specifically sequence Nos 45, 60, 73 , 70 and 65 for 
IgG, IgA, IgD, IgE and IgM, respectively. 

The human CL sequences (Table 2) were those for 
TI (Sequence No. 1) and for NEWM (No. 16) for 
human kappa and lambda chains, respectively. The 
corresponding mouse sequences were those for 
MOPC21 (No. 23) and PLAI-13'CL (No. 31). The 
numbering scheme of Kabat et al. (1983) is used 
throughout this paper. 

Structural comparisons 

The CL:CH1 pairs of domains were structurally 
aligned by a least-squares superposition of the 
various pairs with those of McPC603. This was 
accomplished using program ALIGN (G. H. Cohen, 
unpublished) which is described elsewhere (Satow et 
al., 1986). Only main chain atoms were used here in 
the structural alignments. 

Sequence alignment 

The isolated CL and CHI domains of McPC603 
and KOL were superposed using program ALIGN 
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o acid sequence of the CHI domain from human a- 



ALA ALA ALA GLU ALA GLY ALA — GLY 

SER LYS SER SER PRO ASP SER SER SER 

THR THR PRO ALA THR LYS THR !LE ALA 

LYS THR THR ARG LYS LYS GLN ARG SER 

GLY PRO SER ASN ALA GLU SER TRP ALA 

PRO PRO PRO PRO PRO PRO PRO PRO PRO 

SER SER LYS THR ASP ASP SER GLU THR 

1 VAL 1 VAL VAL ILE VAL MET VAL LEU LEU 

37PHE50 TYR PHE 40 TYR 46 PHE PHE PHE TYR PHE 

8 PRO 17 PRO PRO 13 PRO 15 PRO LEU PRO PRO PRO 

24 LEU 55 LEU LEU 21 LEU 50 ILE LEU LEU LEU LEU 

3 ALA 3 ALA SER II THR 11 ILE SER THR LYS VAL 

PRO PRO LEU 27 LEU 24 SER GLU ARG PRO SER 

SER GLY CYS 9 PRO 15 GLY CYS CYS CYS CYS 

SER SER SER PRO CYS LYS CYS - GLU 

5 LYS 2 ALA THR 1 ALA 3 ARG ALA LYS LYS ASN 

— — — 3 LEU 8 



SER PRO PRO ASN GLY SER PRO 

THR GLN PRO SER LYS GLU ILE THR ASN LEU 

— - _ _ — _ PRO _ _ - 

SER THR ASP — ASP GLU SER ALA PRO SER 

GLY ASN GLY — ASN ASN ASN SER SER ASP 

GLY SER ASN ASP SER GLU ALA MET SER LYS 

THR MET VAL PRO PRO LYS THR THR THR ASN 

— __ _ _____ LEU 

ALA VAL VAL VAL VAL ILE SER — VAL VAL 

8 ALA 10 THR — 15 ILE 25 VAL ASN VAL — ALA ALA 

— — — — — — THR — — — 

5 LEU 8 LEU ILE 3 ILE 9 LEU LEU LEU LEU VAL MET 

1 GLY 3 GLY ALA GLY ALA GLY GLY GLY GLY GLY 
CYS CYS CYS CYS CYS CYS CYS CYS CYS CYS 

9 LEU 37 LEU LEU 2 LEU 29 LEU LEU LEU LEU LEU LEU 
VAL VAL VAL ILE ILE VAL ALA VAL ALA ALA 

12 LYS 17 LYS GLY 1 HIS 3 THR ILE THR LYS GLN ARG 

ASP GLY GLN ASP GLY GLY GLY ASP ASP ASP 

TYR TYR PHE TYR TYR — TYR TYR PHE PHE 

PHE PHE PHE PHE HIS - PHE PHE LEU LEU 

PRO PRO PRO PRO PRO SER PRO PRO PRO PRO 

GLU GLU GLN SER THR GLN GLU ASN ASP SER 

— — GLN GLY ______ 

PRO PRO PRO THR SER PRO PRO PRO SER THR 

VAL VAL LEU MET VAL LEU VAL VAL ILE ILE 

THR THR SER ASN THR LYS MET THR THR SER 

VAL VAL VAL VAL VAL ILE VAL VAL PHE PHE 

SER THR THR THR THR SER THR THR SER THR 

TRP TRP TRP TRP TRP TRP TRP TRP TRP TRP 

ASN ASN SER GLY TYR GLU ASP TYR LYS ASN 

— __ _ ____ TYR TYR 

_ __ _ _____ GLN 

SER SER GLU LYS MET PRO THR SER LYS ASN 

GLY GLY SER SER GLY LYS GLY ASP ASP ASN 

ALA SER GLY GLY THR LYS SER SER ASN THR 

_ __ _ ______ GLU 

LEU LEU GLN LYS GLN SER LEU LEU SER VAL 

THR SER GLY ASP SER SER ASN ASN ASP ILE 

— __ _ ____ILE — 

SER SER VAL ILE GLN ILE GLY MET SER GLN 

GLY GLY THR THR PRO VAL THR SER SER GLY 

VAL VAL ALA THR GLN GLU THR THR THR ILE 

11 HIS 16 HIS ARG 5 VAL 12 ARG HIS MET VAL ARG ARG 
THR THR ASN 1 ASN 2 THR VAL THR ASN GLY THR 

42 PHE 70 PHE PHE 45 PHE 72 PHE PHE LEU PHE PHE PHE 

5 PRO 32 PRO PRO 10 PRO 27 PRO PRO PRO PRO PRO PRO 

2 ALA 2 ALA PRO PRO 2 GLU SER ALA ALA SER THR 
16 VAL 44 VAL SER 6 ALA 28 ILE GLU THR — VAL LEU 

_ _ _ _ _ _ THR — — — 

1 LEU 1 LEU GLN LEU 1 GLN MET LEU LEU LEU ARG 

12 GLN 13 GLN ASN 5 ALA 4 ARG ARG THR GLY ARG THR 

6 SER 4 SER ALA SER ARG ASN LEU SER GLY GLY 
SER ASP SER GLY ASP GLY SER GLU GLY GLY 
GLY — GLY GLY SER ASN GLY — LYS LYS 

_ _ASN — _______ 

LEU LEU LEU ARG TYR — HIS LEU — TYR 

TYR TYR TYR TYR TYR TYR TYR LYS TYR LEU 

5 SER 14 THR THR 2 THR 23 MET THR ALA VAL ALA ALA 

5 LEU 10 LEU THR MET 10 THR MET THR THR ALA THR 

13 SER 33 SER SER 5 SER 32 SER VAL ILE THR THR SER 
SER SER SER ASN SER LEU SER SER SER GLN 

6 VAL 21 SER GLN 30 GLN 49 GLN GLN LEU GLU GLN VAL 
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Table 1. (Continued) 

VAL VAL LEU LEU LEU VAL LEU VAL VAL LEU 

THR TOR THR THR SER THR THR THR LEU LEU 

VAL VAL LEU LEU THR VAL VAL — LEU SER 

PRO PRO PRO PRO PRO LEU - SER PRO PRO 

SER SER ALA ALA LEU ALA SER TRP SER LYS 

SER SER THR VAL GLN SER GLY GLY LYS SER 

17 SER PRO GLN GLU GLN GLU ALA LYS ASP ILE 

— — CYS — — — — — VAL — 
>8 LEU ARG LEU CYS TRP LEU TRP SER MET LEU 

_ _ _ — ____ GLN GLU 

19 GLY PRO ALA PRO ARG ASN ALA ALA GLY GLY 

K> THR SER GLY GLU GLN LEU LYS LYS THR SER 

)2 — — — GLY — — _ 

)3 GLN GLU LYS GLU GLY ASN GLN ASN ASN ASP 

)5 THR THR SER SER GLU HIS MET GLY GLU GLU 

)6 TYR VAL VAL VAL TYR — PHE — HIS TYR 

— — — — - VAL LEU 
)7 ILE THR THR LYS LYS THR THR THR VAL VAL 

CYS CYS CYS CYS CYS CYS CYS CYS CYS CYS 

ASN ASN HIS SER VAL THR ARG HIS LYS LYS 

10 VAL VAL VAL VAL VAL ILE VAL VAL VAL ILE 

ASN ALA LYS GLN GLN ASN ALA THR GLN HIS 

HIS HIS HIS HIS HIS LYS HIS HIS HIS TYR 

LYS PRO TYR ASP THR PRO THR PRO PRO GLY 

PRO ALA THR - ALA LYS PRO PRO ASN GLY 

SER SER — SER SER ARG SER SER GLY LYS 

ASN SER ASN ASN LYS LYS SER PHE ASN ASN 

THR THR PRO PRO SER GLU THR ASN LYS ARG 

18 LYS LYS SER VAL LYS LYS ASP GLU GLU ASP 

— — — — — — TRP — — — 

19 VAL VAL GLN GLN LYS PRO VAL — LYS LEU 

20 ASP ASP ASP GLU GLU PHE ASP SER ASP HIS 
7 LYS 12 LYS VAL LEU ILE LYS ASN ARG VAL VAL 

ARG LYS THR ASP PHE PHE LYS THR PRO PRO 

23 VAL ILE VAL VAL - PRO THR ILE LEU ILE 

26 GLU — — ASN — — PHE LEU PRO PRO 

PRO - — CYS — — SER - - - 

10 LYS 12 — — — — — - - - - 

5 SER 3 — - — - - — — - - 

30 IS CYS 20 — — — — — — — — — 

he sequences are (in order) from human IgGl [Sequence No. 1 in Rabat et al. (1983)], mouse IgGl (No. 45), human IgA (No. 34), mouse 
IgA (No. 60), human IgD (No. 42), mouse IgD (No. 73), human IgE (No. 31), mouse IgE (No. 70), human IgM (No. 38) and mouse 
IgM (No. 65). The numbering scheme (in the leftmost column) is that of Rabat el al. (1983). The total number of contacts in the 
CH 1 : CL pair of KOL are listed on the left and the residue surface areas buried as a consequence of the formation of the KOL CH I : CL 
dimer are listed on the right of some residues in the human IgGl sequence; alongside the mouse IgA sequence are the contacts found 
in the McPC603 CHI :CL pair and the residue surface areas buried in the McPC603 interface. The atomic contacts given here represent 
the totals for each residue and were obtained from Tables 3 and 6. The surface areas are in units of square A J . 



above and the resulting structural alignment was used 
to align the amino acid sequences. In the regions 
where insertions and deletions occur, the sequences 
were aligned visually to maximize homology. The 
other CL and CHI sequences were aligned with those 
of McPC603 and KOL using a version of the pro- 
gram written by M. Murata (Murata et al., 1985) that 
had been modified to use the log odds matrix values 
of Dayhoff et al. (1978) as weights in the amino acid 
comparisons. The alignment was then adjusted to 
ensure that differences in length occurred in the loop 
regions of the domain bilayer structure (see Tables 1 
and 2). 

Surface calculations 

The solvent accessibility of the various structures 
was assessed using program MS of Connolly (1983). 
The solvent accessibility of individual residues was 
computed using program ATMSRF of S. Sheriff 
(Sheriff et al., 1985). The van der Waals atomic radii 
used here were those compiled by Case and Karplus 
(1979); a radius of 1 .5 A was assumed for the solvent 
probe (water). 



Computation of atomic contacts 

The interactions between CL and CHI residues 
were computed using program CONTAX (E. A. 
Padlan, unpublished). Here, a pair of atoms are 
designated as being in contact if they are within 1 .0 A 
of the sum of their van der Waals radii. The atomic 
van der Waals radii used were those compiled by 
Case and Karplus (1979). 



RESULTS AND DISCUSSION 

Tables 1 and 2 show the listing of the interface 
residues aligned with corresponding residues from 
other classes of antibody. Included in Tables 1 and 2 
are the total number of atomic interactions involving 
each residue and the residue surface area buried in the 
CH1:CL interface of the KOL and McPC603 pro- 
teins. The corresponding quantities for NEW and 
J539 are very similar. By and large, the number of 
atomic contacts that a residue makes is paralleled by 
the amount of surface area that is buried as a 
consequence of the formation of the CHI :CL dimer. 
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Table 2. Amino acid sequence of the light chain constant domains of human and mouse lambda and kappa chains 

GLN GLN ARG ARG 7 THR 23 THR SER 7 SER 32 

PRO PRO THR ALA THR GLN VAL 4 TRP 8 

LYS LYS VAL ASP PRO PRO THR 7 THR 22 

ALA SER ALA ALA 2 SER 6 SER GLU ASP 

ALA SER ALA ALA LYS LYS GLN GLN 

PRO PRO PRO PRO 6 GLN 10 GLN ASP ASP 

SER SER SER THR SER SER SER SER 

VAL VAL VAL VAL — — LYS LYS 

1 THR 6 THR PHE 1 SER 11 170 ASN ASN ASP ASP 
LEU LEU ILE 4 ILE 16 ASN ASN SER SER 

42 PHE 66 PHE PHE 60 PHE 81 LYS LYS THR THR 

PRO 4 PRO PRO 6 PRO 24 TYR TYR TYR TYR 

PRO 6 PRO PRO PRO 5 ALA 29 MET SER 7 SER 18 

9 SER 26 SER SER 7 SER 12 6 ALA 14 ALA LEU 10 MET 15 

2 SER 8 SER ASP SER 4 SER 18 SER SER 19 SER 30 
22 GLU 26 GLU GLU 13 GLU 14 SER SER SER SER 

25 GLU 47 GLU GLN 31 GLN 32 27 TYR 53 TYR THR THR 24 

LEU LEU LEU LEU LEU LEU LEU LEU 

GLN GLU LYS THR 180 SER THR THR THR 

ALA 2 THR SER 2 SER 3 LEU LEU LEU LEU 

ASN ASN GLY GLY THR THR SER THR 

4 LYS 3 LYS THR GLY PRO ALA LYS LYS 
ALA ALA ALA ALA GLU ARG ALA ASP 

6 THR 25 THR SER 3 SER 15 GLN ALA ASP GLU 

LEU LEU VAL VAL TRP TRP TYR TYR 

6 VAL 29 VAL VAL 6 VAL 35 LYS GLU GLU GLU 

CYS CYS CYS CYS SER ARG LYS ARG 

29 LEU 47 THR LEU 40 PHE 76 HIS HIS HIS HIS 

5 ILE 10 ILE LEU LEU 190 LYS SER LYS ASN 



140 TYR TYR TYR TYR CYS CYS CYS CYS 

PRO PRO PRO PRO GLN GLN GLU GLU 

GLY GLY ARG LYS VAL VAL VAL ALA 

ALA VAL GLU ASP THR THR THR THR 

VAL VAL ALA ILE HIS HIS HIS HIS 

THR THR LYS ASN GLU GLU GLN LYS 

VAL VAL VAL VAL 200 GLY GLY GLY THR 

ALA ASP GLN LYS — — LEU SER 

TRP TRP TRP TRP - - SER THR 

LYS LYS LYS LYS SER HIS SER SER 

150 ALA VAL VAL ILE THR THR PRO PRO 

ASP ASP ASP ASP VAL VAL VAL ILE 

SER GLY ASN GLY GLU GLU THR VAL 

SER THR ALA SER LYS LYS LYS LYS 

PRO PRO LEU GLU 5 THR SER SER SER 

VAL VAL GLN ARG VAL LEU PHE PHE 3 

LYS THR SER GLN 210 ALA SER ASN ASN 

ALA GLN GLY ASN PRO ARG ARG 8 ARG 16 

GLY GLY ASN GLY THR ALA GLY ASN 

VAL MET SER VAL 12 GLU 21 ASP GLU GLU 

160 24 GLU 33 GLU GLN 12 LEU 42 16 CYS 20 CYS CYS CYS 

THR 5 THR GLU ASN 7 SER SER — — 

The sequences are (in order) from human lambda [Sequence No. 16 in Rabat et al. (1983)], mouse lambda (No. 31), human kappa 
(No. 1) and mouse kappa (No. 23). The numbering scheme is that of Kabat et al. (1983). The total number of atomic contacts in the 
CHI :CL pair of KOL are listed on the left and the residue surface areas buried as a consequence of the formation of the KOL CHI :CL 
dimer are listed on the right of some residues in the human lambda sequence; alongside the mouse kappa sequence are the contacts found 
in the McPC603 pair and the residue surface areas buried in the McPC603 CHI :CI interface. The atomic contacts given here are the 
totals for each residue and were obtained from Tables 3 and 6. The surface areas are in units of A 1 . 



Details of the CH1:CL contacts are presented in 
matrix form in Tables 3-6. In these matrices, each 
element represents the number of pair interactions 
defined as non-bonded interatomic distances of less 
than a certain length between residues across the 
interface. 

In Figs 1 and 2 we see the Fab constant domains 
from McPC603 and from KOL. The corresponding 
structures from J539 and NEW are very similar. The 
domain structure can be regarded as a sandwich or 
flattened cylinder with four strands on one surface 
and three on the other. The interdomain interface is 
formed by the interaction of the two four-stranded 



surfaces. In this interface there are a number of 
amino acid residues that make contact with residues 
of the opposite domain (Tables 3-6). It can be 
observed that there are a few residues on each chain 
that make most of the interchain contacts. In the case 
of McPC603 these include Tyrl22H, Prol23H, 
Leul24H, Phel74H and Prol75H, as well as 
Phell8L, Serl21L, Glul23L, Glnl24L, Phel35L, 
Leul60L and Serl74L. For the most part these 
matrices are very similar for antibodies of the same 
class, i.e. McPC603 resembles J539 (alpha and kappa, 
see Tables 3 and 4), and NEW resembles KOL 
(gamma and lambda, see Tables 5 and 6). 
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Table 3. Contacts between the constant domain* of the light and heavy chains of McPC603 

ypLTLPAL I I LHVNFPAATSQ 
122 123 124 125 126 127 129 130 139 140 146 145 172 173 174 175 177 179 18 6 188 190 

__ ^ 

1117 3 1 

FI18 12 10 23 12 3 

P 1 19 112 2 

S 121 2 5* 

E 123 5 8 

QI24 31 

S 127 2» 

S131 2 1 

V133 6 

F135 3 2 9 3 23 

N 137 1 7 * 

LI60 5 5 2 

S162 2 4* 1 

W163 4 
T164 2 1 2 2 

S 174 2 S 

M175 10 

S176 ' 7 2 

R2I3 7 1 

In this matrix of contacts, the element c(i j) represents the number of interacting atom pairs, one atom from residue i and the other from 
residue j. Atoms are designated as being in contact if the distance between them is within 1 .0 A of the sum of their van der Waals radii. 
The atomic van der Waals radii compiled by Case and Karplus (1979) were used in the computation of these contracts. The one-letter 
amino acid code (DayhofT el at., 1978) is used. The heavy chain residues are across the page and the light chain residues are down the 
page. The residue numbers correspond to those of Rabat tt at. (1983). An asterisk (•) indicates that the contact involves at least one 



For the heavy chain (Table 1), it can be seen that 
certain highly contacting residues are also invariant 
or highly conserved. These include Phel22, Prol23 
and Leu 124 of the first segment of the heavy chain 
together with Leul43, Phel74 and Prol75 in the 
second and third segments. In the light chain (Table 
2), there are several conserved residues in the first 
segment including, in particular, Phel 18, GIul23 and 
Glul24. Other conserved interface residues include 
Thrl31, Vail35 and Thrl62. 

While the above residues present a constant pat- 
tern that might be expected for the interaction of 
CHI and CL, the remaining interface residues are 
quite variable, and are presumably ad hoc con- 

Table 4. Contacts between the constant di 



F 118 
P1I9 
S121 
E123 



tributors to the specificity of the particular combina- 
tion of CHI and CL. Nevertheless, the area that each 
domain contributes to the interface which is excluded 
from solvent is roughly constant at about 500 A 2 (526 
for KOL, 524 for NEW, 518 for McPC603 and 607 
for J539). This may be compared to the solvent 
excluded area of about 700 A J created upon the 
interaction of trypsin with trypsin inhibitors (Janin 
and Chothia, 1976). 

THE INTERFACE CAVITY 

A cavity has been observed in the interface between 
CHI and CL. In the case of McPC603 this is lined by 

mains of the light and heavy chains of J539 



122 123 124 125 126 127 128 139 143 t45 172 173 174 175 177 178 179 186 



FI35 
N137 
L160 
NI6I 
SI62 
W163 
T164 
DI65 
S174 
M175 
S176 
T180 
N212 



(See footnote to Table 3.) 
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Fig. 1. (a) Stereograph of the alpha carbon skeleton of the CHI: CI domains of McPC603. The €L 
domain is on the left. Some residues in each chain are indicated by circles and labelled to serve as reference 
points, (b) The same model with circles to indicate the interacting residues. The radius of each circle is 
proportional to the extent of the interaction us measured by the number of pair interactions that occur 
between atoms of this residue and atoms of the opposite chain. 




Fig. 2{a) tod <b). The same as Fig. 1 but for KOL. 
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Fig. 3. Stereograph of the alpha carbon skeleton of the CH1:CL domains of McPC603 showing the 
location of the cavity (dotted surface) in the interface. The filled circles indicate the residues lining the 
cavity. The orientation is the same as in Fig. 1. 



residues V133, L160, N161, S162, S176 and T178 of 
the light chain and L143, F174, P175, A177, T186, 
Ml 87 and SI 88 of the heavy chain (Fig. 3). The vol 
of this cavity is 143 A 3 , sufficient to accommodate an 
aromatic side chain. Similar cavities occur in J539, 
NEW and KOL, with vols of 146, 70 and 49 A 3 , 
respectively. The cavity is presumably filled with 
solvent, although we have not observed any solvent 
density within the cavity in either McPC603 or J539, 
probably because of the low resolution of the X-ray 
data used in these analyses. It is not clear whether this 
cavity plays a functional role, but a possible role is 
suggested by the variation in the interface residues in 
the different isotypes, namely that it serves to provide 
more flexibility in the interaction between these resi- 
dues. Thus any strain caused by the introduction of 
too bulky a side chain into the interface can be 
relieved by movement of side chains into the cavity. 

This hypothesis was tested by modelling a com- 
posite CHI :CL structure in which the CL was from 
the human KOL lambda chain and the CHI was 
from the mouse McPC603 alpha chain; the KOL CL 
being first maximally aligned with the McPC603 CL 
in the latter's original position relative to CHI (see 
Materials and Methods). Calculation of the atomic 
contact between the domains of this composite struc- 
ture revealed close contacts between the sidegroup of 
Tyrl78 of CL and Serl88 of CHI. Position 178 is 
occupied by either Tyr or Phe in lambda chains and 
by Thr in kappa chains (Kabat et at., 1983); the 
variation at this position represents the most drastic 
and consistent difference between homologous 
lambda and kappa interface residues in terms of size 
(Table 2). Residue 178 lines the interface cavity in 
McPC603 and is therefore placed with plenty of room 
for movement. By turning the sidegroup of Tyr 178 by 
a mere 37° about the CA-CB bond, all the close 
contacts involving this residue were relieved. This 
reorientation positions the sidegroup of Tyrl78 in the 
interface cavity, effectively filling most of the cavity. 
The larger cavity observed in the kappa: alpha pairs 
vs the lambda: gamma pairs can therefore be ac- 
counted for by the necessity for space in order to 
accommodate the threonine to tyrosine change that 
would occur in a kappa to lambda substitution. The 



presence of this cavity then permits this substitution 
to occur without alteration of the mode of associ- 
ation of CHI and CL, this in turn permitting VH and 
VL to adopt their canonical quarternary structure. 

Subsequently, we became aware of a similar 
circumstance that has been observed for T4 
bacteriophage lysozyme (Alber et at., 1985). There a 
mutation that changes Ala 146 (their numbering) to 
threonine propagates a movement of a tryptophan 
residue that in turn causes the movement of the side 
chain of Metl06, which lies in the vicinity of a cavity, 
causing it to move into the cavity. However, whereas 
this result in T4 lysozyme is coincidental, the cavity 
observed in the CHI :CL interface satisfies an evo- 
lutionary requirement of immunoglobulin assembly. 

It would appear that the existence of internal 
cavities may be a natural mechanism for accommo- 
dating mutations which otherwise could cause 
the disruption of the structural integrity of protein 
molecules. 
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